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The signal-to-noise ratios of different speech-encoding schemes have 
been measured in the case where the channel contains errors. Those types 
and probabilities of errors have been considered that are of interest for 
mobile telephone applications. Most encoding schemes use an adaptive 
three-bit quantizer with an explicit transmission of the step-size informa- 
tion. A scheme with an adaptive prediction algorithm has also been studied. 
It has been assumed in all cases that the side information about the quan- 
tizer step size and the predictor coefficients is transmitted in an error- 
protected format. 

Measurements were made by simulating the coding schemes and the 
noisy channel on a digital computer. The results include upper bounds of 
the improvements that can be reached with error protection of the most 
significant bits. 

I. INTRODUCTION 

The suitability of digital coding and transmitting speech signals 
for mobile telephone systems is a question of current interest. In uhf 
systems, Rayleigh fading causes the carrier-to-interference ratio and 
the carrier-to-noise ratio to be low in frequent intervals. This leads 
to high bit-error probabilities in the transmission of the coded signal ; 
the errors occur in bursts. 

This paper compares the effects of channel transmission errors on 
the objective signal-to-noise ratio (s/n) for different encoding schemes. 
Most of these schemes use an adaptive quantizer with time-varying 
step sizes or, equivalently, a time-varying gain control of an amplifier 
in front of a quantizer with fixed step sizes. The side information about 
the step size (or about the amplifier gain) is derived from stored 
samples of the input signal and has to be transmitted together with the 
message block of coded samples (adaptive quantization with forward 
estimation = aqf). These aqf schemes have an excellent idle channel 
performance, even in the presence of channel errors, if the side informa- 
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tion can be transmitted in an error-protected format. Previous 
studies 1-3 always assumed a nonadaptive ^-logarithmic quantizer with 
its specific problems of allowable peak clipping and changing per- 
formance caused by different mean levels of the speech signal. The 
purpose of the present study is to show the s/n performance of some 
adaptive speech-encoding schemes suitable for mobile telephone ap- 
plications. Our measurements were made by simulating the coding 
schemes and the noisy channels on a digital computer. The measure- 
ments include upper bounds for the s/n that can be reached by using 
error-protection schemes to reduce the effective channel error proba- 
bility. We have studied independent, as well as clustered, channel error 
patterns. No attempt was made to study practical error-detection 
or error-correction schemes. However, the results allow us to predict 
the overall performance that can be reached with nonideal error- 
correction schemes. The s/n values of the coding systems with inde- 
pendently distributed channel errors were measured because it is 
possible to produce a nearly equivalent error pattern by scrambling 
the bit stream at the output of the transmitter. 6 Additionally, inde- 
pendent errors will be the primary impairment if the interference is low. 

The organization of this paper is as follows. Section II discusses the 
dependence of the total s/n on the bit-error rate P, assuming gaussian- 
distributed quantizer input data. It has been shown 4 that the gaussian 
probability density function is a good approximation for signals oc- 
curring in speech aqf schemes. Both the natural binary code (nbc) 
and the folded (symmetrical) binary code (fbc) are considered. It is 
shown that the fbc code has a better s/n performance if channel 
transmission errors cannot be ignored. Section III discusses those 
speech-encoding systems used in this study, and Section IV considers 
the types of errors on the channel and gives the main results obtained 
by simulating the encoding schemes and the noisy channels on a 
digital computer. Comparison is made on the basis of the objective 
overall signal-to-noise ratio. Tape-recorded examples were used to 
compare the subjective and perceptual effects of signal-quantization 
and channel errors. Some conclusions are given in Section V. 

A detailed comparison of various speech-encoding schemes on the 
assumption of an error-free transmission is described in another paper. 6 
A companion paper by Jayant 6 gives numerous examples of the per- 
formance of practical error-protecting schemes in the presence of 
channel errors. 

II. DERIVATION OF THE SIGNAL-TO-NOISE RATIO FOR GAUSSIAN SIGNALS 

We calculate the overall signal-to-noise ratio of a three-bit pcm- 
quantization scheme on the assumption that a speech signal can be 
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Fig. 1 — Digital transmission system. 

represented by a gaussian probability density function (pdf) at the 
quantizer input. This assumption is approximately valid if adaptive 
quantization with forward estimation is applied to the speech samples 
and if the message blocks are not too long. 4 Figure 1 shows a block 
diagram of the system under consideration. The quantizer with M = 8 
steps (and m = log 2 M bits per code word) maps each input sample x (n) 
into one of a set of eight rational numbers y (n)e{ v k \k=i, 2,..., m- The 
representation level w, is chosen if w,+i ^ x(n) > Ui, as illustrated in 
Fig. 2. The index i of the input symbol v t of the transmission system 
is transmitted to the receiver in a binary format [binary code word 
y c {n)~\; the received code word z c (n) is interpreted as one of the eight 
output symbols z (ri)e{w k }k=i, 2.---. m- We obtain a change 5,y = \v { — Wj\ 
in amplitude if the transmitted quantizer index i is changed to j 
because of channel errors (Fig. 3). The total mean-squared error is 
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Fig. 2 — Symmetric nonuniform quantizer, m = 3 bits, M = 8 steps, nbc = natural 
binary code, fbc = folded binary code. 
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given by 



(1) 



e? = E[x{n) - z(n)J 

= e 2 4- e 2 

where the quantization error e 2 g and the channel error e 2 are given by 
e 2 = Elx(n) - y(n)J (2) 



e 2 e = E[_y{n) - z(n)J. 



(3) 



Equation (1) is only true on the assumption of a vanishing correlation 
between quantization error and channel error; 7 this is the case if the 
quantizer structure is that of Max 8 (these quantizers lead to a maxi- 
mum s/n performance for a given pdf). 

The mean-squared error caused by digital line errors is 



With 

we have 



M M 

«"=1 j=l 



P( Vi ,wj) = P(v i )-P(w j /v i ), 



M M 

= trace {P„-P c -S 2 }, 



(4) 
(5) 



(6) 



where 

P(v i} Wj) 



is the joint probability of an input symbol i>, at the 
transmitter and an output symbol Wj at the receiver, 
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5,7 = \vi — Wj\ is the amplitude of the error occurring if the input 

symbol v.- has been chosen at the transmitter and if 

the output symbol Wj has been interpreted at the 

receiver, 

P(vi) is the probability of input symbol Vi occurring, 

P{vjj/vi) is the conditional probability that the output symbol 

Wj will be received if the input symbol i>, is sent, 
P v is a diagonal matrix with elements P («,), 

P c is the channel transition matrix with elements 

P(wj/vi), and 
5 2 is the matrix of squared error amplitudes with ele- 

ments Stj. 

The conditional probabilities P(wj/Vi) can be calculated easily if the 
bit errors on the channel are distributed independently. The values 
depend on the bit-error probability P and on the code. The probability 
P(iOj/vi) that Wj will be received if Vi is sent is just the probability that 
digital errors will occur in the D places where they differ and that no 
errors will occur in the m — D remaining places, 

P(wj/ Vi ) = P D (l - P) m ~ D , (7) 

where P is the bit-error probability on the channel, D is the Hamming 
distance between the code words representing the symbols y, and wj, 
and m is the number of bits per code word. The Hamming distance D 
depends on the code ; two codes have been considered (see Fig. 2) : 
(i) the natural binary code (nbc), (it) the folded binary code (fbc). 
For this code, the most significant bit gives polarity information; the 
remaining bits represent the signal magnitude in natural binary code. 
For example, the transition from input symbol Vi to output symbol 
ws = v& causes an error 8is in amplitude. The Hamming distances are 
D = 3 and D = 1 for the nbc code and fbc code, respectively (see 
Fig. 2). Using (7) we get P(w»/vi) = P 3 with the nbc code and P(ws/vi) 
= P(l - P) 2 with the fbc code. 

The mean-squared error caused by digital line errors can also be 
calculated if (ideal) error-protection schemes are applied : 

Scheme ep 1 : The most significant bit of each code word is perfectly 
error-protected. 

Scheme ep2: The two most significant bits are perfectly error- 
protected. 

The channel transition matrix P c has to be modified in these cases 
because some elements of the matrix are zero then. These necessary 
modifications are not described in this paper. Using (1) and (6), we 
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obtain the signal-to-noise ratio 



a\ 



A 



where 



s/n = lO-logio-4 = 10-logio J + trafle {p f .p - .p] 



a\ = £?[x»(n)] 



(8) 

(9) 



is the mean-squared power of the input signal. The normalized 
quantization noise variance e 2 Q /<rl has a value of 0.03451 if the structure 
of the three-bit quantizer is that of Max. 8 Using this value, the s/n per- 
formance has been calculated as a function of the bit-error rate P 
(Fig. 4). The two lower curves show that the fbc code outperforms the 
nbc code. The folded binary code has therefore been used in all simula- 
tions. The upper curves demonstrate the advantage of an (ideal) 
error protection of the most significant bit (epi) and of the two most 
significant bits (ep2). It may be relevant to mention that this error 
protection leads to a reduction of the effective bit-error rate. To con- 
firm the theoretical results, we have measured some s/n values with 
the simulation program that has been used for the study of the speech- 
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Fig. 4 — Signal-to-noise ratio performance of a gaussian data transmission system. 
nbc = natural binary code, fbc = folded binary code, epi = error protection of 
most significant bit, ep2 = error protection of two most significant bits. 
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Table I — Channel coefficients of optimum three-bit Gauss 

quantizers. NBC = natural binary code, 

FBC = folded binary code. 
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a 3 


Optimum nonuniform quantizer 

NBC 
FBC 

Optimum uniform quantizer 

NBC 
FBC 


6.9124 
5.7445 

7.2113 
5.5672 


-2.7373 
-0.4015 

-3.3611 
-0.0128 


-0.3133 
-0.3133 

0. 
0. 



encoding systems. The results obtained with gaussian input data and 
the fbc code compare favorably with those determined using (8) (see 
Fig. 4). 

The channel error variance e? as given in (6) can be transformed into 



4- £« r P>. 



(10) 



The coefficients a ; contain the total information about the effects of 
channel errors on the performance of an encoding-decoding scheme. 
Thus, different schemes can be compared easily. Table I lists the 
ocj coefficients of optimum uniform and nonuniform gaussian three-bit 
quantizers. Note that there are only very small differences between 
nonuniform and uniform quantizers. Note also that the fbc code 
should be chosen; it has a nearly 1-dB advantage over the nbc code 
if the bit-error rate is very high (see also Fig. 4). 

In the discussions so far, a quantizer has been assumed that is 
optimum (in the sense of a maximum s/n) if the channel is error-free. 
A higher overall s/n performance can be reached, however, if the 
quantizer is reoptimized for a given channel transition matrix P c . 9 
Figure 5 shows an example with a three-bit quantizer optimum for a bit- 
error rate of 0.025 and the nbc code. In fact, we get a better s/n per- 
formance if the bit-error rates P are high, but the shortcoming is the 
decrease in s/n for low P values; this reduction is approximately 2.3 
dB in the error-free case (P = 0). The reoptimization of the quantizer 
can therefore only be of interest if the channel noise statistics are time- 
invariant, or if the s/n decrease for small error rates can be accepted. 
This will be the case if a quantizer with a higher number of step sizes is 
chosen. 

The principal aim of this section was to show the calculation of the 
s/n of a (nonadaptive) pcm scheme if channel errors are present. These 
calculations can be extended to dpcm systems and to burst error 
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Fig. 5 — Signal-to-noise ratio performance of two nonuniform gaussian three^bit 
quantizers, (a) Quantizer is optimized for an error-free transmission, (b) Quantizer 
is optimized for independent channel errors of rate P = 0.025. nbc code has been 
used in both cases. 

channels. But it seems very difficult to find theoretical solutions for 
coding schemes that apply adaptive quantization and adaptive pre- 
diction strategies. Instead of looking for such solutions, we simulated 
different encoding schemes and channels on a digital computer and 
measured the overall s/n. We shall find some similarities to the results 
we obtained in this section. Particularly, we shall use the channel 
coefficients a, to compare the performance of the coding schemes. 

III. DESCRIPTION OF THE CODING SYSTEMS 

To get a good quality of the coded speech with low bit rates, we have 
used pcm and differential pcm schemes that employ adaptive quantizers. 
Both nonadaptive prediction (dpcm) and adaptive prediction (adpcm) 
have been applied. The advantage of adaptive quantization is that 
the quantizer is always adjusted to the highly variable variance of the 
speech signals. Thus, a better s/n performance is achieved. 4 - 10 dpcm 
and adpcm schemes provide an additional s/n gain over pcm; this is 
especially true if the predictor responds to changes in the short-term 
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spectrum of speech (adpcm). 10 ' 11 A nonadaptive logarithmic companded 
pcm has been included in our study because it very often serves as a 
standard reference in coder comparisons. A great number of speech- 
encoding schemes have been compared in a companion paper 5 on the 
basis of s/n as performance measure using the same speech signal 
employed in this paper. The effect of channel errors on s/n performance 
has been studied using the following encoding schemes : 

Scheme 1: PCM, nonadaptive (Fig. 6a). The quantizer has a ^100 
characteristic, 12 and the loading is four times the standard deviation 
of the speech signal to be quantized. 

Scheme 2 : PCM-AQF (Fig. 6b). Thirty-two samples of the input signal 
are buffered, and the maximum value of this block determines the 
gain of the amplifier in front of the quantizer (adaptive quantization 




(a) 




Fig. 6 — Speech-encoding schemes, (a) Nonadaptive pcm. (b) pcm with adaptive 
quantization (pcm-aqf). (c) dpcm and adaptive dpcm (adpcm) with adaptive quantiza- 
tion (dpcm-aqf and adpcm-aqf). q = quantizer, gc = gain control, B + gce 
= buffer and gain and coefficients estimation. 



SPEECH-ENCODING CHANNEL ERRORS 1623 



with forward estimation = aqf). The characteristic of the quantizer 
is optimum for signals with a gaussian probability density. 

SchemeS: DPCMl-AQF (Fig. 6c). A predictor with one time-invariant 
coefficient is being used in connection with adaptive (forward estima- 
tion) quantization (aqf). Thirty-two samples of the input signal are 
buffered, and the maximum difference between neighbored samples 
determines the gain of the amplifier in front of the quantizer. The 
characteristic of the quantizer is optimum for signals with a gaussian 
probability density. The predictor coefficient that is optimum for the 
speech signal being used is hi = 0.85. Lower values lead to a better 
performance of the dpcm scheme in the case of high error probabilities ; 
this will be shown in Section V. 

Scheme £: ADPCMl-AQF (Fig. 6c). In this adaptive prediction scheme, 
32 samples of the input signal are buffered, the normalized short-term 
correlation coefficient between neighbored samples of this block is cal- 
culated, and the predictor coefficient is set to this correlation coefficient. 
The gain of the amplifier in front of the quantizer is determined by 
calculating an estimation value of the standard deviation of the differ- 
ence signal ; the amplifier gain is set to the inverse of this estimation 
value. The characteristic of the quantizer is optimum for signals with 
a gaussian probability density. 

Scheme 5: ADPCM4-AQF (Fig. 6c). In this adaptive prediction scheme, 
four optimum predictor coefficients are calculated for each segment of 
32 samples from the first values of the short-term autocorrelation 
function ; see the description of Scheme 4 for further details. 

The folded binary code (fbc) was used in all cases. It should be 
mentioned that the combination of controlled amplifier and fixed 
quantizer in the adaptive quantization schemes is equivalent to a 
quantizer with a step-size adaptation. Some adaptive quantization 
schemes that use the transmitted code words for the control of the 
amplifier gain (adaptive backward estimation) have also been studied. 
The simulations have shown that these schemes cannot be used for 
channels with high bit-error probabilities ; the overall s/n turned out 
to be less than dB in most cases. 

IV. ERROR PERFORMANCE: RESULTS AND DISCUSSION 
4.1 Simulation system and types of errors 

The dependence of the overall signal-to-noise ratios of five speech 
coding schemes (see Section III) on the average bit-error probability 
P has been determined for different types of noisy channels. The s/n 
is given by 

s/n(P) = 101og 10 4> 01) 
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where e? and a\ are defined in (1) and (9), respectively. The s/n values 
have been measured for bit-error rates of 0, 0.001, 0.0125, 0.025, and 
0.05. The measurements were made by simulating the coding schemes 
and the noisy channels on a digital computer. Channels with inde- 
pendent, as well as correlated, error patterns have been studied. The 
statistically independent errors have been generated by using a pseudo- 
random noise generator program. Tape recordings with error patterns 
of actual fading signals have been used in the channel simulation of 
burst errors. The error patterns are typical for uhf mobile radio trans- 
mission. The statistics of these errors are described in Ref. 6. In all 
simulations, it has been assumed that it is possible to transmit the 
information about the gain of the amplifier (aqf scheme) and/or about 
the predictor coefficients (adaptive prediction) without any error. 
Increased signal-to-noise ratios can be reached for a given P value 
using error-detection and error-correction schemes. In studying these 
error-protected cases, it has been assumed that all errors are corrected. 
Practical schemes will not always be able to correct all errors. There- 
fore, the s/n values given in this paper for the error-protected case 
represent an upper bound on the performances of error-protecting 
techniques. Two types of error correction have been studied : 

epI : Protection of the most significant bit ; this bit is the sign bit. 
ep2: Protection of the two most significant bits. Only changes to 

neighbored output symbols are possible in this case (if the 

quantizer has eight step sizes). 

A 2.3-second utterance ("the boy was mute about his task" ; female 
voice ; bandwidth 200 to 3200 Hz ; sampling rate 8 kHz) has been used 
in all simulations. 

4.2 Results 

The s/n performances of the coding schemes that have been de- 
scribed in Section III have been measured using three-bit quantizers 
and the folded binary code. Figures 7 to 11 show the measured depend- 
ence of the s/n on the average bit-error rate P in the case of burst errors. 
The lower, middle, and upper curves refer to the unprotected trans- 
mission and to the epI and ep2 error-protection schemes; note that 
the effective bit-error rate is reduced then. We show this using Fig. 9 
as an example. The s/n value for P = is due to the quantization 
noise only. The lower curve shows a considerable loss in s/n for high 
bit-error rates P. An increase in s/n can be obtained by protecting the 
most significant bit (epI; middle curve) or the two most significant 
bits (ep2; upper curve). For example, if P = 0.025, the s/n value 
without error protection is 10.4 dB. A value of 14 dB is obtained with 
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Fig. 7 — Signal-to-noise ratio performance of a three-bit log pcm scheme in the 
presence of correlated errors. 

the protection of the most significant bit (epI). The effective bit-error 
rate is reduced to f X 0.025 = 0.0167 in this case because | of the 
errors are assumed to be corrected at the receiver. The 14-dB value of 
the epI curve is 2 dB higher than the s/n value we get for P = 0.0167 




0.02 0.03 

BIT ERROR RATE.P 



Fig. 8 — Signal-to-noise ratio performance of a three-bit pcm-aqf scheme in the 
presence of correlated errors. 
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0.02 0.03 

BIT ERROR RATE, P 

Fig. 9 — Signal-to-noise ratio performance of a three-bit dpcmi-aqf scheme in the 
presence of correlated errors. The predictor coefficient has a value h t = 0.6. 

on the lower curve (no error protection). We expect this result because 
no errors occur on the most significant bit for the effective 0.0167 bit- 
error rate of the ep l curve. An s/n value of 16.9 dB is obtained with the 
protection of the two most significant bits (ep2). The s/n value for 
the effective bit-error rate of J X 0.025 = 0.083 is 14.3 dB if the errors 
occur on all bits of the code words (lower curve) ; therefore, a 2.6-dB 
increase in s/n is due to the fact that only the least significant bits are 
affected. 

Error protection, however, is only possible by inserting redundancy 
into the code words. Let us assume that it is possible to obtain an 
error protection of the two most significant bits by using three re- 
dundant bits for each three-bit code word. The total bit rate is now 6 
bits per sample. Let us further assume that doubling the transmission 
rate causes doubling the bit-error rate (this is true for phase-modulation 
systems). Again using Fig. 9, we find an s/n value of 15.2 dB for 
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0.02 0.03 

BIT ERROR RATE.P 

Fig. 10 — Signal-to-noise ratio performance of a three-bit adpcmi-aqf scheme in the 
presence of correlated errors. 

p = 0.05 (ep2 curve). On the other hand, the s/n value without error 
protection is 10.4 dB for P = 0.025. Therefore, an improvement of 
nearly 5 dB over the transmission without error protection has been 
obtained. A similar discussion using the epi values shows that we get 
only a small s/n advantage then : an error protection of the sign bit is 
not sufficient for improving the overall performance. 

To better compare the performances of the coding schemes in the 
presence of errors, we have plotted the s/n values of these schemes with 
P as a parameter (Fig. 12). The s/n values for P = are due to the 
quantization errors only; the increase in s/n as compared to log-PCM 
starts with 7 dB (pcm-aqf) and goes up to 14 dB (adpcm4-aqf). At 
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Fig. 11 — Signal-to-noise ratio performance of a three-bit adpcm4-aqf scheme in the 
presence of correlated errors. 



higher bit-error rates, the different coding schemes have approximately 
identical performance because the output noise due to channel errors 
predominates over the quantization noise due to the quantizer. 

Recall from (1) that the total error variance can be expressed as the 
sum of the quantization error variance e\ and the channel error variance 
e 2 c if the mutual error is neglected. The term e 2 g can be determined from 
the s/n for P = 0; hence, we can separate the values e 2 c for the four 
bit-error rates P that have been used in the simulations ; these are the 
values P = 0.001, 0.0125, 0.025, and 0.05. The channel error can be 
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PCM PCM- DPCM1- ADPCM1- ADPCM4- 

AQF AQF AQF AQF 

h, = 0.6 

Fig. 12 — A comparison of the s/n performance of three-bit encoding schemes at 
different bit-error rates P (correlated errors). 



expressed approximately as 



e 2 c = a x P + « 2 P 2 



(12) 



because the third term of (10) can be neglected. 

The coefficients «i and a 2 have been determined using the measured 
data by searching for the minimum of the mean-squared differences 
between measured and calculated s/n values. These coefficients a\ and 
a 2 describe the effect of channel errors on the performance of the coding 
schemes. The total channel error variance is mainly determined by the 
channel error coefficient a.%. Figure 13 shows that the an values of those 
encoding schemes that use the same gaussian quantizer (all aqf 
schemes) are not very different. From this, we conclude that trans- 
mission errors are no more serious for dpcm and adpcm schemes than 
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for pcm; this has already been mentioned for dpcm in Ref. 2. The 
channel error performance is better for burst errors than for inde- 
pendently distributed errors. This has partly to do with the fact that 
some bursts appear in low-level parts of the speech sample. On the 
other hand, we cannot neglect the a 2 term in the case of burst errors ; 
a 2 is the coefficient of the P 2 term in (12) ; this term mainly represents 
the channel error contribution caused by two bit errors in a code word. 



TRANSMISSION WITHOUT ERROR-PROTECTION 
PROTECTION OF THE SIGN-BIT (EP1-SCHEME) 
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Fig. 13 — Channel coefficients ai and at. I = independent errors, B = burst errors. 
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This contribution is higher in the case of burst errors; it causes a 
stronger decrease in signal-to-noise ratio as can be seen in Fig. 14 
showing the s/n performance of a dpcmi-aqf scheme both for inde- 
pendent and correlated errors. We find the same tendency if we protect 
the most significant bit (epI; see Fig. 13) or the two most significant 
bits. We have used the average ai values to calculate the s/n increase 
if we apply (perfect) error protection : the increases are approximately 
3 and 11 dB for the epi scheme and ep2 scheme, respectively. Note, 
from Fig. 13, that the a 2 term can be neglected in the case of pcm 
schemes ; we therefore have a slower decrease in s/n at high bit-error 
rates. This can also be seen from Fig. 15, where we compare the s/n 
performance of a pcm-aqf scheme with dpcmi-aqf schemes that have 
different values of the predictor coefficient. Lowering this value, we 
obtain a higher channel error resistance, but pcm-aqf outperforms the 
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Fig. 14 — Signal-to-noise ratio performance of a three-bit dpcmi-aqf scheme in the 
presence of independent errors (I) and burst errors (B). The value of the predictor 
coefficient is hi = 0.6. 
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0.02 0.03 

BIT ERROR RATE, P 

Fig. 15 — Comparison of the s/n performance of dpcmi-aqf schemes with different 
values hi of the predictor coefficient with a pcm-aqf scheme. 

dpcmi-aqf scheme if the predictor coefficient is too low (note the 
different slopes of the curves at high bit-error rates). 

Our simulations involved not only three-bit quantization but also 
quantization schemes with a greater number of step sizes. Figure 16 
illustrates a typical example : the signal-to-noise ratios of the four-bit 
quantization schemes are nearly 6 dB higher than the signal-to-noise 
ratios of the corresponding three-bit quantization schemes if the 
channel is error-free. But this increase is lost in the presence of high 
channel error rates : all three-bit and four-bit systems have a similar 
s/n performance for high bit-error rates. Higher s/n values can only 
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Fig. 16 — Comparison of three- and four-bit encoding schemes. 



be reached by error protection, that is, by reducing the effective bit- 
error rate. 

V. CONCLUSIONS 

In this paper, we have determined the s/n performance of various 
speech-encoding schemes in the presence of high bit-error rates (up 
to 5 percent) ; both independent and correlated error patterns have 
been used. Some conclusions can be drawn from the quantitative results 
that have been obtained and from informal subjective listening tests. 

(i) It is possible, without pitch tracking, to quantize speech signals 
with three bits per sample such that the decoded signal is nearly indis- 
tinguishable from the original signal (adaptive prediction schemes in 
connection with adaptive quantization). A simple scheme with a fixed 
predictor (one coefficient) and an adaptive quantization can be chosen 
for a bit rate of four bits per sample. 

(it) Adaptive quantization lowers the idle channel noise and in- 
creases the s/n (the three-bit quantizer with a logarithmic character- 
istic has a very poor performance). 
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(iii) Only the adaptive quantization schemes (aqf schemes) with 
an explicit error-protected transmission of the step-size information 
can be used in the case of high channel error probabilities. 

(iv) The folded binary code (fbc) outperforms the natural binary 
code (nbc) for large bit-error rates. 

(y) Burst errors cause a stronger decrease in signal-to-noise ratio 
for large bit-error rates than independent errors. 

(vi) Transmission errors are no more serious for dpcm and adpcm 
schemes than for pcm. 

(vis) All coding schemes show approximately the same s/n per- 
formance for high bit-error rates, because the contribution of the noisy 
channel to the total error is much higher than the contribution of the 
quantizer. Therefore, a better s/n performance can only be reached 
by using error-protection schemes, not by increasing the number of 
quantizer steps. 

(viii) A high-quality decoded signal can be obtained with a protec- 
tion of the two most significant bits. An improvement in decoded 
signal quality can be realized even if a doubling of the bit-error rate 
(caused by the higher transmission rate) has to be tolerated. 

(ix) Significant-bit-packed codes that provide only protection of the 
sign bit (ep l scheme) are not very efficient. 

(x) Adaptive quantization schemes suppress the idle channel noise ; 
therefore, channel errors produce decoded noise only with very small 
amplitudes in silent intervals. This fact makes the decoded speech 
perceptually more pleasing. 

(xi) Nonadaptive and adaptive prediction schemes have a better 
perceptual quality than pcm schemes when bit errors occur on the 
channel. The power density spectrum of the error sequence is shaped 
in the dpcm or adpcm feedback loop such that the main contribution 
of the error is in the low-frequency range. This error spectrum is 
perceptually less objectionable. 

It is important to realize that the numerical results of this paper are 
based on a single speech record of one speaker. However, we expect the 
broad conclusions of this paper, as summarized above, to be true of a 
wide range of input speech material. 

VI. ACKNOWLEDGMENTS 

This work has been carried out in the Department of Acoustics 
Research during a summer visit. The author would like to express his 
very sincere thanks to many members of this department for their 
help, and especially to J. L. Flanagan as the head of this department 
for his support of this work. Special mention must be made of the 
benefit of stimulating discussions with N. S. Jayant and R. B. Kuc. 

SPEECH-ENCODING CHANNEL ERRORS 1635 



REFERENCES 

1. I. Dostis, "The Effects of Digital Errors on PCM Transmission of Compandored 

Speech," B.S.T.J., U, No. 10 (December 1965), pp. 2227-2243. 

2. K. Chang and R. W. Donaldson, "Analysis, Optimization, and Sensitivity Study 

of Differential PCM Systems Operating on Noisy Communication Channels," 
IEEE Trans, on Communications, COM-20, No. 3 (1972), pp. 338-350. 

3. I. Yan and R. W. Donaldson, "Subjective Effects of Channel Transmission 

Errors on PCM and DPCM Voice Communication Systems," IEEE Trans, on 
Communications, COM-20, No. 3 (1972), pp. 281-290. 

4. P. Noll, "Adaptive Quantizing in Speech Coding Systems," Proc. of the Inter- 

national Zurich Seminar on Digital Communications, 1974, pp. B3(l)-(6). 

5. P. Noll, "A Comparative Study of Various Quantization Schemes for Speech 

Encoding," B.S.T.J., this issue, pp. 1597-1614. 

6. N. S. Jay ant, "Step-Size Transmitting Differential Coders for Mobile Tele- 

phony," B.S.T.J., this issue, pp. 1557-1581. 

7. R. E. Totty and G. C. Clark, "Reconstruction Error in Waveform Transmis- 

sion," IEEE Traas. Information Theory (Correspondence), IT-13, 1967, pp. 
336-338. . 

8. J. Max, "Quantizing for Minimum Distortion," IRE Trans. Information Theory, 

IT-6, 1960, pp. 7-12. 

9. A. J. Kurtenbach and P. A. Wintz, "Quantizing for Noisy Channels," IEEE 

Trans, on Communications, COM-17, No. 2 (1969), pp. 291-302. 

10. N. S. Jayant, "Digital Coding of Speech Waveforms: PCM, DPCM, and DM 

Quantizers," Proceedings of the IEEE, 62, No. 5 (1974), pp. 611-632. 

11. P. Noll, "Untersuchungen zur Sprachcodierung mit adaptiven Pradiktions- 

verfahren," Nachrichtentechnische Zeitschrift, 27, H. 2 (1974), pp. 67-72. 

12. B. Smith, "Instantaneous Companding of Quantized Signals," B.S.T.J., 36, No. 

3 (May 1957), pp. 653-709. 



1636 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1975 



